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This paper investigates reversibility properties of 1-dimensional 
3-neighborhood (/-state finite cellular automata (CAs) under pe¬ 
riodic boundary condition. A tool named reachability tree has 
been developed from de Bruijn graph which represents all possi¬ 
ble reachable configurations of an n-cell CA. This tool has been 
used to test reversibility of CAs. We have identified a large set of 
reversible CAs using this tool by following some greedy strate¬ 
gies. Our conjecture is that the reversible CAs, defined over infi¬ 
nite lattice, are always reversible when the CAs are finite. How¬ 
ever, the reverse may not be true. 
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I INTRODUCTION 

A cellular automaton (CA) is a discrete dynamical system which consists of 
a regular network of finite state automata (cells). The cells update their states 
following a local update function of the states of their neighbors. Since their 
invention [44], cellular automata (CAs) have been an area of extensive re¬ 
search for the scientists of various disciplines due to their capability of mod¬ 
elling different physical systems [42, 47, 3], Various properties of them are 
explored by researchers, one of most importance being reversibility property 
(or, invertibility or bijectivity). 
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The reversibility property of a CA refers to that each configuration of the 
CA has a unique predecessor. This implies, there is no loss of informa¬ 
tion during the evolution of the CA. This property has direct correspondence 
with the reversibility of microscopic physical systems, implied by the laws of 
quantum mechanics. The reversible (or bijective) CAs have been utilized in 
different domains, like simulation of natural phenomenon [16], cryptography 
[45, 2, 13], pattern generations [33, 21], pseudo-random number generation 
[46, 7], recognition of languages [22] etc. 

The study on reversibility of CAs was started with Richardson [35], and 
Amoroso & Patt [1], In their seminal paper, Amoroso and Patt provided effi¬ 
cient algorithms to decide whether a one-dimensional CA, debited by a local 
map / is bijective or not. It was later shown that it is not possible to de¬ 
sign an efficient algorithm that tests bijectivity of an arbitrary CA, debned 
over higher dimensional (two or more) lattice [19]. However, the research on 
one-dimensional reversible CAs was continued [34, 36, 25, 26, 43, 30, 28]. 
A decision algorithm for CAs with bnite conbgurations was given in [14], 
Other variants are given in [5, 29], An elegant scheme based on de Bruijn 
graph to decide whether a one dimensional CA is reversible is presented in 
[41], The algorithms of [1, 14, 41], however, deal with inbnite lattice, but 
silent about CAs with bnite number of cells. We have pointed out that some 
of the reversible CAs over a bnite number of cells are wrongly concluded as 
irreversible CAs by those algorithms (see Section V). It may be mentioned 
here that, bnite CAs are the interest of researchers, when the CAs are utilized 
to model some real-life problems. 

Moreover, these algorithms decide only whether a given CA is reversible 
or not. What will happen if one wants to identify a set of reversible CAs in 
one dimension? In fact, it is a practical requirement of researchers who target 
reversible CAs to model their problems. The algorithms of [1] and [41] do 
not directly guide to the solution of this question. One possible way to get the 
set of reversible CAs is, consider a huge set (if not the whole set) of CAs, and 
then use the decision algorithms to pick the reversible CAs. Obviously, this 
is not a practical solution to this problem. 

During last 20 plus years, people working with one-dimensional reversible 
CAs have addressed above two issues in different ways [32, 17, 37, 39, 31]. 
The researchers primarily concentrated on a special class of CAs, the local 
map / of which is linear [2], and they are referred to as linear CAs [ 18, 24, 
4, 12], The reason of choosing this class of CAs is, standard algebraic tools 
can be used to characterize them. It can be noted here that during last 20 plus 
years, reversible CAs are extensively explored and in most of the cases, the 
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CAs are linear [2, 4, 12]. It can further be noted that, the CAs are primarily 
binary [23, 15]. Recently, above two issues are addressed with non-binary 
CAs that have more than two states per cell [4, 12], But these CAs are again 
linear. On the other hand, non-linear reversible CAs are studied in recent 
times, and they are targeted to model some well-known problems [7, 8, 9, 10], 
However, these non-linear CAs are also limited with 2 states per cell. 

In this scenario, we take up this research to develop efficient schemes to 
test reversibility of finite CAs and to identify a set of reversible CAs with 
4-states per cell (4 > 2 ). The CAs are one-dimensional and neighborhood 
dependency is considered as 3. As is well-known after Smith, a CA with 
higher neighborhood dependency can always be emulated by another CA with 
lesser, say 3-neighborhood dependency [ 38 ], Hereafter, by “CA”, we will 
mean one-dimensional 3-neighborhood finite CA with 4 states per cell (4 > 
2 ). The schemes presented in this paper can deal with both linear and non¬ 
linear CAs. 

In this work, we first develop a characterization tool which is named as 
Reachability Tree (Section III). This tool is instrumental in developing the¬ 
ories for finite CAs. We identify the properties of reachability tree when 
it presents a reversible CA (Section IV). Exploring these properties, we de¬ 
velop an algorithm to test reversibility of a finite CA (Section V). Section V 
also points out that algorithm of [1] is not appropriate for testing reversibil¬ 
ity of finite CAs, because some conditionally reversible CAs are erroneously 
concluded as irreversible CAs. We finally report three greedy strategies to get 
a set of reversible finite CAs (Section VI). 

II DEFINITIONS 

In this work, we consider one-dimensional 3-neighborhood CAs with peri¬ 
odic boundary condition where cells of the CA form a ring = Z/nZ, n is 
the length of the CA. That is, the CAs are finite. Each cell can use a set of 
states S = {0,1, • • • , 4 — 1}. The next state of each cell is determined by a 
local rule / : S' 3 —> S. A configuration of a CA at a time t is defined as a 
collection of states of all cells at t. 

According to their global behavior, CAs can be classified as reversible and 
irreversible. In a reversible CA, the initial configuration repeats after certain 
number of time steps. Therefore, all the configurations of a reversible CA are 
reachable from some other configurations and each configuration has exactly 
one predecessor. On the other hand, in an irreversible CA, there are some 
configurations which are not reachable (non-reachable configurations) from 
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any other configuration. Moreover, some configurations of such a CA are 
having more than one predecessors. 

To understand global behavior of CAs, a mathematical tool, named de 
Bruijn graph, is used by various researchers [41, 27, 40], An m-dimensional 
de Bruijn graph of k symbols is a directed and edge-labelled graph represent¬ 
ing overlaps between sequences of symbols. In general, the de Bruijn graph 
(k,m), where k is the number of symbols and to is the dimension, has k m 
vertices and k m+1 edges. The graph is balanced in the sense that each vertex 
has both indegree and outdegree k [11]. The de Bruijn graph can be exploited 
to decide whether a given CA is reversible [41], 

A CA, defined by a local rule /, can be expressed as a de Bruijn graph 
of dimension to — 1, where to is neighborhood size (= 3 in our case) over 
k = IIS'! symbols. So, if S = {0,1, 2}, the graph will have 3 2 = 9 vertices 
and 3 3 = 27 edges. Each edge is labelled with xyz/v where xyz represents a 
sequence of 3 symbols from S which come from the overlap of labels of the 
two nodes of that directed edge and v is the next state value for that edge of 
the rule defined by /. 



Figure 1: The de Bruijn Graph of CA 201210210201210210201210210 

Figure 1 represents a 3-state CA. The graph shows that if the left, self and 
right neighbors of a cell are all 0s, then next state of the cell (that is, /(0, 0, 0)) 
is 0, if the neighbors are 0, 0 and 1 respectively, the next state is 1, and so on. 
The rules can also be expressed by a tabular form. Table 1 represents the 
rule of Figure 1 (rule of 2 nd row). Note that the table has an entry for each 
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value of xyz. In this work, however, we refer each of the edge label xyz 
as Rule Min Term (RMT) because this representation can be viewed as Min 
Term of three variable Switching function. For our convenience, we generally 
represent RMTs by their corresponding decimal equivalents. 

Definition 1 The combination of neighborhood x, y, z with respect to the 
value f(x. y, z), where f : S 3 S is the local rule of a CA, is called Rule 
Min Term (RMT). Each RMT is associated toanumberr = xxd 2 +yxd+z. 
We denote the value f(x , y, z) by f[r]. 


Table 1: Rules of 3-state CAs 



Therefore, the number of RMTs of a d-state CA rule is d 3 . We represent 
a rule by the values of f[r] with / [0] as the right most digit. Now, if the 
graph of Figure 1 is observed, it can be seen that each node has 3 incoming 
edges and 3 outgoing edges. In general, a node of the de Bruijn graph of a 
d-state CA has d incoming edges and d outgoing edges. Therefore, the set 
of incoming RMTs (resp. outgoing RMTs) are related to each other. Note 
that in Figure 1, last (resp. first) 2 digits of any set of incoming RMTs (resp. 
outgoing RMTs) are same. We call the set of incoming RMTs as equivalent 
RMTs, and the set of outgoing RMTs as sibling RMTs. 

Definition 2 A set ofd RMTs n, r^, ■■■, rd of a d-state CA rule are said to be 
equivalent to each other if rid = r 2 d = ... = r<id (mod d 3 ). 

Definition 3 A set of d RMTs Si, S 2 ,Sd of a d-state CA rule are said to 
be sibling to each other if J = = ... = [^J. 

The rationale behind choosing the name equivalent is - if one traverses the 
de Bruijn graph of a d-state CA, then a node can be reached through any one 
of the d incoming edges, hence all edges are equivalent with respect to the 
reachability of the node. On the other hand, after reaching a node, one can 
keep on traversing the graph by selecting any of the outgoing edges, to which 
we name sibling, because they are coming out from the same mother node. 
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We represent Equii as a set of RMTs that contains RMT i and all of its 
equivalent RMTs. That is, Equii = {*, d 2 + i, 2d 2 + 1, ■ • • ,(d— 1 )d 2 + i}, 
where 0 < i < d 2 — 1. Similarly, Siblj represents a set of sibling RMTs 
where Siblj = {d.j, d.j + 1, ■ ■ ■ , d.j + d — 1} (0 < j < d 2 — 1). However, 
one can observe an interesting relation among RMTs during traversal of the 
graph. In Figure 1, if RMT 1, (or RMT 10 or RMT 19) is used to visit a 
node, then either RMT 3 or RMT 4 or RMT 5 is to be used to proceed further 
traversal. Table 2 shows the relationship among the RMTs of 3-state CAs. In 
general, if RMT r £ Equii of a d-state CA is used to reach a node, then the 
next RMT to be chosen is s £ SibU. 

Table 2: Relations among the RMTs for 3-State CA 


Incoming 

Outgoing 

#Set 

Equivalent RMTs 

Decimal Equivalent 

#Set 

Sibling RMTs 

Decimal Equivalent 

Equio 

000, 100, 200 

0, 9, 18 

Sibl 0 

000, 001,002 

0, 1,2 

Equii 

001, 101,201 

1, 10, 19 

Sibli 

010,011,012 

3,4,5 

Equii 

002, 102, 202 

2, 11,20 

Sibh 

020, 021,022 

6, 7,8 

Equiz 

010, 110,210 

3, 12,21 

Sibh 

100, 101, 102 

9, 10, 11 

Equii 

Oil, 111,211 

4, 13, 22 

Sibh 

110, 111, 112 

12, 13, 14 

Equi& 

012, 112,212 

5, 14, 23 

Sibh 

120, 121, 122 

15, 16, 17 

Equie 

020, 120, 220 

6, 15, 24 

Sibl 6 

200, 201,202 

18, 19, 20 

Equii 

021, 121,221 

7, 16, 25 

Sibl T 

210,211,212 

21,22, 23 

Equi% 

022, 122, 222 

8, 17, 26 

Sibh 

220, 221,222 

24, 25, 26 


The next configuration of a given configuration can also be found by travers¬ 
ing the de Bruijn graph. Following example illustrates this. 

Example 1 Let us take the configuration 1012 of 4-cell CA of Figure 1. To 
get the next configuration of 1012, we form a 2-digit overlapping window and 
start from node 10 as the first two digits of 1012 are 10. From node 10, we 
use edge 101 and go to node 01, then from it following edge 012, we go to 
node 12; from node 12, we go to node 21 by the edge 121 and finally, from 
node 21, we come back to our starting node 10 by the edge 210. For each 
of the edges we traverse, we get a next state value. By these next states, we 
get the next configuration as 1200. The traversal is shown by dotted arrow in 
Figure 1. 

Ill THE REACHABILITY TREE 

In this section, we develop a discreet tool, we call it Reachability tree, for an 
n-cell d-state CA. The tree enables us to efficiently decide whether a given 
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n-cell CA is reversible or not. Moreover, it guides us to identify reversible 
CAs. Reachability Tree was initially proposed for binary CAs [9], which is 
generalised here for d-state CAs. 

To test reversibility of a CA, de Bruijn graph may be utilized. In [41], a 
scheme based on de Bmijn graph was developed to test reversibility of CAs 
with infinite lattice size. However, for finite CAs, a straight forward scheme 
of testing reversibility can be developed - consider each of the possible config¬ 
urations, find next configuration using de Bruijn graph. If each configuration 
is reachable and has unique predecessor, declare the CA as reversible. 

Finding of next configuration of a given configuration using de Bruijn 
graph is simple, and can be done in 0{n) time, where n is the size of configu¬ 
ration (see Example 1). However, finding of next configuration of all possible 
configurations of an n-cell CA is an issue. The de Bruijn graph does not 
directly give any information about the existence of non-reachable configura¬ 
tions. 

Reachability tree, on the other hand, depicts the reachable configurations 
of an n-cell CA. Non-reachable configurations can directly be identified from 
the tree. The tree has n+1 levels, and like de Bruijn graph, edges are labelled. 
However, here the labels generally contain more than one RMT. A sequence 
of edges from root to leaf represents a reachable configuration, where each 
edge represents a cell’s state. 

Definition 4 Reachability tree of an n-cell d-state CA is a rooted and edge- 
labelled d-ary tree with (n+1) levels where each node N t _j (0 < i < n, 0 < 
j < <T - 1) is the collection of d 2 sets of RMTs, and the root N 0 0 is the 
collection of all sets of sibling RMTs. We denote the edges between Ni j (0 < 
i < n — 1, 0 < j < d l — 1) and its possible d children as Ei. d j +m = 
(Ni.j, N i+ i. dj+m ,li. dj+m ) where li. d j+m is the label of the edge and 0 < 
to < d — 1. Like nodes, the labels are also collection of d 2 sets of RMTs. 
Let us consider that V p Ni - j is the p th set of the node Nij, and r q E, dj+m is 
the q th set of the label on edge Ei. d j +m , (0 < p,q < d 2 — 1), So, Ni.j = 

( Fp Ni j )o<p<cP-% andU. d j +m = (T q Ei dj+m ) 0 < q < d 2 _ 1 . Following are the 
relations which exist in the tree : 

1. [For root] N 0 . 0 = (T^ 0 - 0 ),,<*<*_!, where T k N ° ° = Sibl k . 

2. Vr G Tk Ni ' j , r is included inTk Ei ' dj+rn , if f[r] = to, Vto = 0 , 1, • • • ,d— 
1, where f is the rule of the CA. 

3. r k Ni -i = Y k Ei dd u r k E ^+ i u • • • u r k Ei - d w-v (o < k < d 2 - 1). 
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4. \/r, if r £ r k Ei dj+rn , then RMTs d.r (mod d 3 ), d.r+1 (modd 3 ),--- , d.r+ 
(d — 1) (mod d 3 ) are in Y k Ni+ldj+m (0 < m < d — 1). 

5. [Forlevel n-2]T N ”- 2 -i k+i . d ( mod &) C {*, i+d, i+2d, *+3d, • • ■ ,i+ 

(d 2 — l)d}, 0<i<d — l,0<fe<d — 1,0 <j< d n ~ 2 — 1. 

6. [For level n—l]T k Nn ~ 1:i C {k+ixd 2 | 0 < i < d— 1}(0 < k < d 2 — 1). 

Note that, the nodes of level n — 2 and n — 1 are different from other 
intermediate nodes (Points 5 and 6 of Definition 4). Only a subset of selective 
RMTs can play as T k Ni - j in a node Ni.j, (0 < k < d 2 — 1,0 < j < d l — 1) 
when i = n — 2 or n — 1. In fact, for any node at level n — 2 or level n — 1, 
only j of the RMTs that are possible if the node be an intermediate node, can 
exist. Finally, we get the leaves with T k Nn j , where T k Nn j is either empty or 
a set of sibling RMTs. Note that, T^ 0 0 is a set of sibling RMTs (Point 1 of 
Definition 4) and I V V "'° = (J , J • However, in our further discussion we 
shall not explicitly define i and j of node AT , or edge AT , if they are clear 
from the context. 

Example 2 Reachability tree of a 4-cell CA with 3 states per cell is shown in 
Figure 2. As it is of 3 states, a node AT, can have at most 3 children - 
Ni + i, 3 j + i and iVj+i. 3 ^+ 2 - Hence, maximum number of nodes possible in the 

o4+1 -i 

tree for a 4-cell 3-state CA is 3 _~ = 121. Figure 2, however contains 

105 nodes. The root of the tree is Aq.o = (r^° °)o<fc<8> where T ^ 0,0 is a 
set of sibling RMTs ( Sibl k ). Note that the root is independent of CA rule (it 
depends only on d, the number of states per cell), whereas other nodes are 
rule dependent. However, an arbitrary RMT can not be a part of nodes of 
level n — 2 and n — 1 . For example, F ^ 21 of the node N 2.1 (that is, AT- 2 . 1 ) 
of Figure 2 is {18}. If we follow point 4 of Definition 4, then RMTs 19 and 
20 should also be part of T But, they could not, because the node is at 
level n — 2. Similarly at level 3, for node AT. 3 , in the set F | /V: -' 3 only RMT 
1 is present. So, only } of the possible RMTs can be part of any set at level 
n — 2 or n — 1. 


Reachability tree gives us information about reachable configurations of 
the CA. However, some nodes in a reachability tree may not be present, which 
we call non-reachable nodes, and the corresponding missing edges as non- 
reachable edges. No RMT is present in a non-reachable node or in the label 
of a non-reachable edge. 
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Definitions A node is non-reachable if {J 0<k<d2 _ i r k '" J = 0. Similarly, 
an edge is non-reachable if Uo<fc<d 2 -i ^k' 3 = 

The edges of the tree associate the states of CA cells, and a sequence of 
edges from the root to a leaf represents a reachable configuration. Since d 
number of edges can come out from a parent node, we call the left most edge 
as 0-edge which represents state 0, second left most edge as 1-edge which 
represents state 1, and so on. The right most edge represents state d — 1. 

Definition 6 An edge Eij is calledm-edge if f[r] =m,r £ Uo<fc<d 2 -i Fjf ‘' 3 , 
where 0 < m < d 1, and f is the CA rule. 

Therefore, the sequence of edges (Eo,j 1 , E 3 ,j 2 ,E n _i.j n ), where 0 < 
ji < d l — 1,1 < i < n, represents a reachable configuration. 

Example 3 In Figure 2, there are some non-reachable edges - E 3 i, E 32 ,E 33 , 
E :i (, etc of which the labels are empty. Corresponding nodes N 3 . 3 , N 3 . 2 , -/V3.3, 
A 3.6 etc are non-reachable nodes. However, the sequence (A’o.o, A’i . 1 , T’ 2 . 3 , 
£- 3 . 11 ), represents the reachable configuration 0102. 

IV REACHABILITY TREE AND REVERSIBLE CAS 

This section studies the reachability tree of reversible CAs. These studies are 
utilized in Section V and Section VI. 

Definition 7 A rule is balanced if it contains equal number ofRMTs for each 
of the d states possible for that CA; otherwise it is an unbalanced rule. 

Example 4 Rule 201210210201210210201210210 is balanced, because the 
rule contains nine 0s, nine Is and nine 2s. 

Theorem 1 : The reachability tree of a reversible CA is complete. 

Proof : : Since all the configurations of a reversible CA are reachable, the 
number of leaves in the reachability tree of an n-cell d-state CA is d" (number 
of configurations). Therefore, the tree is complete as it is a d-ary tree of 
(n + 1) levels. □ 

The above theorem points to the fact that the identification of a reversible 
CA can be done by constructing the reachability tree of the CA. If there is 
no non-reachable edge in the reachability tree, then the CA is reversible. Fol¬ 
lowing theorem further characterizes the reachability tree of a reversible CA. 
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Theorem 2 : The reachability tree of a d-state CA is complete if and only if 

i) The label for any j, contains only one RMT, i-e. | Uo<fc<d 2 -i j | = 

1. 

ii) The label l n _ 2 .j, for any j, contains only d RMTs, Le-,\ Uo<fc<d 2 -i Tk En ~ 2:i | = 

d. 

iii) Each other label Ipj contains d 2 RMTs, i.e. | Uo<fc<d 2 -i |= d 2 , 

where 0 < i < n — 3. 

Proof: : 

For “if ” Part: 

Let us consider, the number of RMTs in the label of an edge is less than 
that is mentioned in (i) to (iii). That means, 

(i) There is no RMT in the labelZ n _i .j, for some j. That is, Uo<fc<d 2 -i ^ = 

0. It implies, the tree has a non-reachable edge and so, it is incomplete. 

(ii) The label l n - 2 .j contains less than d RMTs, for some j. That is, | 

Uo<fc<d 2 -i f'k En ~ 2 5 |< d — 1. Then, the number of RMTs in the node 
N n —\ j < d(d — 1). Since the node is at level (n — 1), only ^ of the RMTs 
are valid according to the Definition 4. So, the number of valid RMTs is 
— = ~ 1)’ which implies that the maximum number of possible 

edges from the node is d — 1. Hence, there is at least one non-reachable edge 

E n - 1.6 for which Uo<fc<d 2 -i r k Erl ~ l b = 0- 

(iii) Say, each other label U.j contains less than d 2 RMTs, i.e. j Uo<fc<d 2 -i ^k Ei i |< 
d 2 , (0 < i < n — 3). Then, | Uo<fe<d 2 -i rfc Afi+1J |< d 3 . Here, the node 

Ni+i.j may have d number of edges. In best case, the tree may remain com¬ 
plete up to level (n — 2). Then there exists at least one edge E n _ 3 p , for 
which | U 0 <fe<d 2 -i r fe' E " _3 ' P l< rf2 - which makes a node iV n _ 2 . p where 
| (Jo</c<d 2 -i Tk Nn ~ 2 p |< d 3 . Since the node is at level (n — 2), it has max¬ 
imum 'EA-zTl = (J2 _ ]_) valid RMTs. This implies, there exists at least 
one edge, incident to N n _ 2 .p , for which | (Jo<fc<d 2 -i Tk En ~ 2 q \< d, which 
makes the tree an incomplete one by (ii). 

On the other hand, if for any intermediate edge Ei j 1 , \ Uo<fc<d 2 -i jl |> 
d 2 , then an edge 2 can be found at the same label i for which | Uo<fc<d 2 -i r k Ei j2 |< 
d 2 , where 0 < i < n 3, and j-\ / j 2 . Then, by (iii), the tree is in¬ 
complete. Now, if for any p, label l n - 2 . P contains more than d RMTs, then 
also there exists an edge E n _ 2 .q for which | Uo<fc<d 2 -i Tk En ~ 2 q \< d. 

Hence, the tree is incomplete (by (ii)). Similarly, if for an edge E n _ i. m , 
Uo<fc<d 2 -l r fc E — lm |> 1, then also the tree is incomplete. Therefore, if 
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the number of RMTs for any label is not same as mentioned in (i) to (iii), the 
reachability tree is incomplete. 

For “ Only if ” Part: 

Now, let us consider that, the reachability tree is complete. The root N 0 0 
has d 3 number of RMTs. Now, these RMTs have to be distributed so that 
the tree remains complete. Let us take that, any edge LVj, has less than 
d 2 RMTs, another edge /i’o j2 has greater than d 2 RMTs and other edges 
E 0 .j, (0 < j,ji,j 2 < d - 1 k jx ^ j 2 £ j) has d 2 RMTs. Then node N 1 , jl 
has less than d 3 RMTs, N- t j2 has greater than d 3 RMTs and other edges y 
has d 3 RMTs. The tree remains complete at level 1. Now, this situation may 
continue upto level n — 2. At level (n — 2), only ^ of the RMTs are valid (see 
Definition 4). So, the nodes with less than d 3 RMTs has at maximum d 2 1 
valid RMTs and so on. For such nodes at level n — 2, there exists at least one 
edge E n _ 2 . p , such that | Uo</c<d 2 -i F k En ~ 2 ' p |< d for which the tree will 
have non-reachable edge (by (ii)). The situation will be similar if any number 
of intermediate edges have less than d 2 RMTs implying some other edges at 
the same level have more than d 2 RMTs. Hence the tree will be incomplete 
which contradicts our initial assumption. So, for all intermediate edges E, 7 , 

I Uo<fc<d 2 -i T k Ei j |= d 2 , where 0 < i < n - 3. 

Now, if this is true, then at level n — 2, the nodes have d 3 number of RMTs 
out of which d 2 are valid. If an edge L n - 2 .p has less than d RMTs, then 
the node N n -i. p has at maximum d(d — 1) RMTs out of which only d — 1 
are valid. Hence, at least one edge, incident to iV n _i. p , is non-reachable 
making the tree incomplete. Similar thing happens if there exist more edges 
like E n _ 2 .p- So, each edge label l n - 2 .j, must have d RMTs. In the same way, 
each of the edge labels l n -i.j, for any j , are to be made with a single RMT 
to make the tree complete. Hence the proof. 

□ 

Corollary 1 : The nodes of a reachability tree of a reversible CA contains 

1. d RMTs, if the node is in leveln orn—1, i-e. \ {J Q<k<d 2 _ 1 Tk Nij \= d 
for any j, when i = n or n — 1. 

2. d 2 RMTs, if the node is at level n — 2 i.e \ Uo<fc<d 2 -i T k Nrl ~ 2j |= d 2 
for any j. 

3. d 3 RMTs for all other nodes i.e Nij, \ Uo</c<d 2 -i ^k Ni j |= d 3 , 
where 0 < i < n — 2. 
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Proof : : This is directly followed from Theorem 2, because for each RMT 
on an edge E i;] , d number of sibling RMTs are contributed to N l+ - t 

□ 

Like CA rules, we classify the nodes of a reachability tree as balanced and 
unbalanced. 

Definition 8 A node is called balanced if it has equal number of RMTs cor¬ 
responding to each of the d-states possible; otherwise it is unbalanced. So, 
for a balanced node, number of RMTs with next state value Os = number of 
RMTs with next state value Is = ■ ■ ■ = number of RMTs with next state value 
(d — l)s. 

Therefore, the root of the reachability tree of a balanced rule is balanced, 
because number of RMTs associated with each of the d states is d 2 . 

Lemma 1 The nodes of the reachability tree of a reversible CA are balanced. 

Proof : : Since, the reachability tree for a reversible (/-state CA is complete, 
so, each node has d number of edges i.e d number of children. Since a node 
Ni.j contains d 3 RMTs when 'i. < n — 2 (Corollary 1) and an edge Ei+ i.fc, 
for any k, contains d 2 RMTs (Theorem 2), the node N,^ is balanced. Here, 
number of RMTs, associated with same next state values, is d 2 . Similarly, the 
nodes of level n — 2 and n — 1 are balanced. 

□ 


Theorem 3 : A CA with unbalanced rule is irreversible. 

Proof : If the rule is unbalanced, then it has unequal number of RMTs 
corresponding to each state. That means, the root node N 0 0 is unbalanced. 
Therefore, there exists an edge E^.j where Uo<*:<d 2 -i J < d 2 {0 < j < 
d 2 — 1). Hence the CA is irreversible by Theorem 2. 

□ 

However, the CAs with balanced rules can not always be reversible. Fol¬ 
lowing example illustrates this. 

Example 5 Consider the CA 201012210201012210201012210 of Figure 2. 
The rule has 9 RMTs for each of the states 0,1, and 2, so, it is balanced. Each 
node Ni.j, when i < n — 3 = 1 contains 27 RMTs, each node at level n — 2 
i.e N 2 .j contains 9 RMTs and each node of level n — 1 i.e. contains 3 
RMTs. However, all the nodes of level n, i.e. Ai 1y , 0 < j < 3 1 - 1 do not 
contain 3 RMTs; for example, the nodes A r 10 . Aij 5 , N 4.20 etc consist of 9 
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RMTs, but the nodes -ZV 4 . 1 , A 4 . 5 . /V 4 . 9 . A^.ig etc are empty. So, the CA does 
not satisfy Corollary 1 and it is irreversible. 

Depending on the theoretical background developed in this section, we 
now identify reversible d-state CA in the next section. 


V DECISION ALGORITHM FOR TESTING REVERSIBILITY 

The simplest approach of testing reversibility of an n-cell CA is, develop 
the reachability tree of the CA starting from root, and observe whether the 
reversibility conditions given by the theorems 1 and 2 are satisfied for the 
given rule or not. If there is any such node/ edge that does not satisfy any 
of these conditions, then the CA is irreversible , otherwise it is a reversible 
CA. The problem of this approach is that if the CA is reversible then the tree 
grows exponentially, so when n is not very small, it is difficult to handle the 
CA with this approach. However, we have following two observations - 

1. If Ni j = Ni^ when j ^ k for any i, then both the nodes are roots of 
two similar sub-trees. So, we can proceed with only one node. Sim¬ 
ilarly, if li.j = li.k (i 7 ^ fc), then also we can proceed with only one 
edge. 

2. If Ni.j = Ni'.k when i > i' , (0 < i, i' < n — 3), then the nodes that 
follow k are similar with the nodes followed by N, :r Therefore, we 
need not to explicitly develop the sub-tree of N^j. If this case happens, 
then we have to be careful only when we develop the levels n — 2 and 
ra — 1. It is observed that after few levels, no unique node is generated. 
So, for arbitrary large n, we need not to develop the whole tree. 

Following above two observations, we can develop minimized reachability 
tree which does not grow exponentially. In fact, very few nodes are generated 
in such minimized reachability tree. In the proposed algorithm, minimized 
tree is developed and in each step it is verified whether the conditions of 
reversibility are satisfied or not. 

The algorithm takes a ci-state CA rule and the number of cells (n) as input 
and outputs “Irreversible” if the n-cell CA is irreversible and “Reversible” 
otherwise. In the beginning, the algorithm checks whether the input CA is 
balanced. If not, it decides the CA as irreversible. Otherwise, the algorithm 
starts developing the reachability tree. In each step of the tree development. 
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we verify whether the conditions of Corollary 1 is followed. If not, we declare 
the CA as reversible. 

However, after getting the nodes of a level, the algorithm always removes 
duplicate nodes following point 1 of the above observation. The algorithm 
further verifies whether the remaining nodes follow point 2 of the above. If 
any node does so, the node is treated as processed, and the node points to 
some node of previous level. In this way, we may find that all the nodes are 
marked as processed. If so, we do not generate the nodes of the tree further, 
but jump to level n — 3. We now find all the nodes of level n — 3. This step is 
crucial because we have to keep in mind that some nodes may have pointed to 
other nodes. Finally we develop nodes of levels n — 2 and n — 1, and declare 
the CA as reversible if all the conditions of reversibility are satisfied. 


Example 6 Let us consider a 3-state CA 000111222000111222000111222 
with n = 100 as input. Note that the CA is balanced. Following our algo¬ 
rithm, we get 3 nodes iVi.o, iVi.i and IV 1.2 at level 1 where Uo<fc<8 I\- Wl 0 = 

{18 - 26} (T 2 Nl ° = T 5 n ' ° = T 8 N ' ° = {18 - 26}), U 0 <fe< 8 r*" 1 ’ 1 = 

{9 - 17 } (rA 1 = r 4 JVl - 1 = rA 1 = {9 - 17 }) and U 0 <fe<8 = 

{0 - 8} ( Tq ^ 1 ' 2 = T 3 Nl - 2 = T 6 Nl - 2 = {0 - 8}). For each node, the sets 
whose contents are not mentioned, are empty. At level 2 , we get 9 nodes 
where nodes IV2.0 = iVi.o = {18 — 26 }, IV2.4 = IV1.1 = {9 — 17 } and 
IV2.8 = IV1.2 = {0 — 8}. So, we remove the duplicate nodes N 2 . 0 , IV2.4 and 
N 2 . 8 . The other nodes of level 2 are IV2.1, IV2.2, IV2.3, IV2.5, IV2.6 and N 2 . 7 , 
where U 0 < fc < 8 I^ 2 ' 1 = {9 - 17 } (T 2 N ™ = T 5 N ^ = F^ 21 = {9 - 17 }), 

Uo< fc <8 Tfe^ 2 ' 2 = |0 ^ 8} (T 2 N2 - 2 = T 5 N2 - 2 = r 8 N2 - 2 = {0-8}), 
Uo< fc <8 r fc Ar2 ' 3 = i 18 - 26 } (r^ 2 - 3 = r 4 iV2 - 3 = r 7 iV2 - 3 = {is - 26 }), 
Uo<fc<8 Tfc^ 2 ' 5 = { 0 ^ 8 } CTi^ 2 ' 5 = r 4 iV2 ' 5 = IV* 2 - 8 = {0 - 8}), 
Uo<fc<8 T k N °- 6 = {18 - 26} (To^ 2 6 = Fa^ 2 - 6 = Te^ 2 - 6 = {18 - 26}) and 
U 0 < fc <8 r ^ 2 ' 7 = {9 - 17 } (To* 2 - 7 = Fg^ 2 7 = Te^ 2 - 7 = {9 - 17 }). At 
level 3 , the nodes IV3.3 = IV3.6 = IV2.0, N 3 ^ = IV3.7 = IV2.1, IV3.5 = IV3.8 = 
N 2 . 2 , IV3.9 = IV3.15 = IV2.3, iV3.r0 = IV3.I6 = IV2.4, iV3.11 = iVt.17 = N 2 . 5 , 
-V3.18 = iV3.2i = N2.Q, iV3.19 = iV3.22 = N2.7 and N3.20 = IV3.23 = iV2.8• 
So, all these level 3 nodes are duplicate nodes. We do not process these nodes 
and directly generate the nodes of level n — 3 . We can see that, the nodes 
of level n — 3 are the nodes of level 2. However, we generate the nodes of 
levels n — 2 and n 1. The CA satisfies the reversibility conditions and our 
algorithm reports this CA as “ Reversible” for n = 100 . 
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The proposed algorithm has some similarities with Amoroso & Patt’s al¬ 
gorithm [I] and its variants (see e.g. [14, 39, 29, 5]). Amoroso & Patt’s 
algorithm (and some other algorithms e.g. [14, 5]) develops a tree-like struc¬ 
ture to decide if the input CA is reversible. These trees, like our algorithm, 
do not grow exponentially. Unlike these tree structures, however, the reacha¬ 
bility tree is well debited. The tree is debned here for hnite CAs, whereas the 
algorithms of [1, 14, 39, 29, 5, 6]) are developed for inbnite lattice. Since we 
deal here with hnite CAs, there are many reversible CAs (decided by our al¬ 
gorithm), which are wrongly concluded as irreversible CAs by the other algo¬ 
rithms. However, the converse is not true - that is, CAs decided as reversible 
by Amoroso & Patt’s and others’ algorithms are also concluded as reversible 
by our algorithm. To explore this point, we brieby discuss Amoroso & Patt’s 
algorithm and compare it with ours. 

Amoroso & Patt’s algorithm for has two parts - decision procedure for 
global surjectivity and also for global injectivity. For CAs, global injectivity 
implies global surjectivity, so, injective CAs are themselves bijective or re¬ 
versible [20], In our algorithm, we test reversibility by virtually constructing 
the reachability tree with minimal number of nodes (unique nodes only) and 
check whether or not there is any node that disrespects reversibility condition 
of Corollary 1 . Our algorithm correctly identibes the reversible CAs for bnite 
cell length n, and also the CAs which are reversible for some specibc values 
of n, such as, n is even/odd, n is multiple of 3 etc. But Amoroso & Patt’s 
algorithm [1], which is for inbnite lattice size, fails to identify n dependent 
CAs. Following are two examples : 

Example 7 Let us take a 3-state CA with rule 201210210201210210201210210 
and test reversibility by decision procedure of [1], For this CA, both the 
global surjectivity test and global injectivity test of [ I ] returns the result that 
the CA is surjective and injective, that is, it is a bijective or reversible CA. 
Our algorithm also reports the CA as reversible for any cell length n. 

Example 8 Let us take another 3-state CA with rule 10222101010222101010 
2221010 and test reversibility by decision procedure of [ I ]. We check the CA 
by the global surjectivity testing procedure and bnd that the graph has termi¬ 
nal nodes. Then we test for global injectivity and bnd that for some pairs, the 
sequence sets are the same pair themselves. For example, for the pair (10,3), 
the sequence set is (10, 3 ), where the numbers in the pair are the decimal 
equivalents of the 3-tuples (see Ref. [1] for detail procedure). Thus, the CA 
fails the injectivity test. So, by [1], the CA is irreversible. But actually the 
CA is reversible when its cell length n is a odd number which can be tested 
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by our algorithm. Therefore, the decision procedures of [ l] fails to identify n 
dependent reversible CAs. 


Following the above observations, we believe that all the reversible CAs, 
defined over infinite lattice are also reversible for any value of n, but the 
converse is not true. 

Conjecture 4 The reversible CAs, defined over infinite lattice, are always 
reversible when the CAs are defined over finite number of cells. However, 
the reversible CAs, defined over finite lattice, may not be reversible if lattice 
size is infinite. 


VI IDENTIFICATION OF REVERSIBLE CAS 

This section reports the efficient ways of identifying a set of reversible CAs. 
Since some CAs are reversible only for some particular types of n (the num¬ 
ber of cells), we specify n, when we identify a set of reversible CAs. How¬ 
ever, it can also be noted that, there are many reversible CAs which are re¬ 
versible for all values of n (> 3). Thus, the set of CAs identified in this 
section as reversible, contains both types of CAs - //-independent reversible 
and //-dependent reversible. 

One can, however, intuitively design the following straight forward ap¬ 
proach to get a set of reversible CAs - consider a set of CAs and then use 
our algorithm to select reversible CAs from the set. This trial-and-error ap¬ 
proach is not practical, because most of the CAs are irreversible. So, it is very 
difficult to identify a number of reversible CAs. 

Instead of considering a set of arbitrary CAs, one can repeat the above 
procedure with balanced rules only, because unbalanced rules are always ir¬ 
reversible CAs (Theorem 3). Unfortunately, the number of balanced //-state 
CA rules is priya- (the total number of arrangements of d 3 RMTs where d 
groups of RMTs have same next state value with each group size d 2 (Defini¬ 
tion 7)), and most of the balanced rules are irreversible! Therefore, arbitrary 
choosing of balanced rules for testing reversibility is not very helpful. In this 
scenario, we take greedy approach to choose the balanced rules which are 
potential candidates to be reversible. 

It is pointed out in Section IV that nodes of a reachability tree of a re¬ 
versible CA are balanced (see Definition 8 and Lemma 1). If a rule is bal¬ 
anced, the root which contains all RMTs of the rule, is also balanced. Our 
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greedy approach is, choose the balanced rules in such a way that all the nodes 
up to level n — 3 also remain balanced. Then, use our algorithm to test re¬ 
versibility of the selected balanced rules. Success of this scheme, however, 
remains on how efficiently we are choosing the balanced rules. 

We observe that the equivalent RMTs result in a same set of (sibling) 
RMTs at next level (see Section II). For example, for a 3-state CA, RMT 0 
and RMT 9 are equivalent to each other and both of them produce RMTs 0,1 
and 2 in next level (see Table 2). We exploit this property to develop our first 
greedy strategy. Let us recall that, Equii = { i, d 2 +i, 2 d 2 +i, ..., (d—l)d 2 -t-*} 
is a set of equivalent RMTs where 0 <i < d 2 — 1 . However, our first strategy 
of rule selection is - 

STRATEGY I: Pick up the balanced rules in which equivalent RMTs have 
different next state values, that is, no two RMTs of Equii (0 < i < d 2 — 1) 
have same next state value. 

If we follow STRATEGY /, a label of one edge incident to the root, con¬ 
tains exactly one RMT from Equii, for any i (0 < i < d 2 — 1). This implies, 
the nodes of level 1 contains all the d 3 RMTs of the rule. That is, the nodes 
of level 1 are similar to the root. Hence, the nodes of level 2 are also simi¬ 
lar to the root. This scenario continues until level n — 3. However, we use 
our algorithm to test whether this scenario continues further for levels upto n, 
that is, whether the CA is reversible or not. These types of CAs, however, are 
vibrant candidates to be reversible. 

Example 9 The 3-state CA 222222222111111111000000000 is reversible 
for any value of n. Here equivalent RMTs have different next state values, e.g. 
RMTs 0,9 and 18 are 0,1 and 2 respectively. A 5-state CA 222222222222222 
222222222211111111111111111111111114444444444444444444444444 
33333333333333333333333330000000000000000000000000, having differ¬ 
ent next state values in equivalent sets, is also reversible for any n. 

d 2 

There are (d!) balanced rules that can be selected as candidates follow¬ 
ing STRATEGY I. However, more rules can be selected as candidates if we 
look into sibling RMTs in similar fashion. Recall that, Sibli = {d.i,d.i + 
1, d.i + 2,..., d.i + d — 1} is a set of sibling RMTs, where 0 < i < d 2 — 1. 
It is directly followed from the definition of the reachability tree that all the 
RMTs of Sibli are either present in a node or none of the RMTs is present, 
for any i. That is, no node in the tree (except the nodes of level n — 2 and 
ri—1) partially contains the elements of any sibling set. Keeping in mind this 
property, we develop our next greedy strategy of rule selection - 
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STRATEGY II: Pick up the balanced rules in which the RMTs of a sibling 
set have the different next state values, that is, no two RMTs of Sibli (0 < 
i < d 2 — 1 ) have same next state value. 

If a rule is picked up following STRATEGY II, all the nodes except the 

d 2 

nodes of level n — 2 and n 1 are always balanced. There are ( d \) number 
of such balanced rules. These rules are also good candidates to be reversible 
CAs. Like previous, we use our algorithm to finally decide which of this type 
of rules are reversible. 

Example 10 The 3-state CA 120120210120120210120120210 is reversible 
for any n. In this CA, the sibling RMTs have different next state values. Two 
other 3-state CAs 102120210102120210102120210 and 12002121012002121012 
0021210 are also reversible for odd n. The 4-state CA 012301230123012301230 
1230123012301230123012301230123012301230123 is reversible for any n. 

The 5-state CA 432104321043210432104321043210432104321043210432104321043 
21043210432104321043210432104321043210432104321043210432104321043210 
is also reversible for any n. 


If we select rules following STRATEGY I and STRATEGY II, we will 
be able to identify a large set of /(-cell reversible CAs. Here, the set of rules, 
selected out of STRATEGY / and the set of rules, selected out of STRATEGY 
//are not disjoint. We now report our 3 rd strategy of rule selection where 
the members of Sibli (0 < i < d 2 — 1) have same next state value. 

STRATEGY III : Pick up the balanced rules in which - (1) the RMTs of 
Sibli for each i, have same next state value, and (2) either the RMTs of 
Siblk.d, Siblk.d+i, •••) Siblk.d+d-i have same next state value, where 0 < 
k < d — 1, or RMTs of those d sets have different next state values. 

There are d\ + {d\) d number of balanced rules that can be selected fol¬ 
lowing STRATEGY III as candidates to be reversible CAs. One can easily 
verify that in this case also, all but nodes of level n — 2 and level n — 1 are 
balanced. 

Example 11 The 3-state CA 222111000222111000222111000 is reversible 
for any n. In this CA, the sibling RMTs have same next state values. An¬ 
other 3-state CA 222000111222111000222111000 is also reversible for any 
n, where the sibling RMTs have same next state values. The 4-state CA 
1111222200003333111122220000333311112222000033331111222200003333 
is reversible for any n. The 5-state CA 00000111112222233333444440000011111 
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222223333344444000001111122222333334444400000111112222233333444440 
000011111222223333344444 is also reversible for any n. 

Therefore, we can identify a large set of reversible CAs after using above 
strategies and our algorithm. 

VII CONCLUSION 

In this work, we have developed reachability tree from de Bruijn graph exclu¬ 
sively to test reversibility for 3-neighborhood ci-state periodic boundary CAs. 

We have also reported three greedy strategies for finding a set of reversible d- 
state CAs. We have conjectured that the reversible CAs, defined over infinite 
lattice, are always reversible when the CAs are defined over finite number of 
cells. However, the reversible CAs, defined over finite lattice, may not be 
reversible if lattice size is infinite. 

In future this work can be extended in the following directions: 

• Though, in our process of finding a rule, we have found a large number 
of reversible CAs, but the set is not exhaustive. By further exploration, 
some more rules can be found. 

• This work can be further extended for G?-state CAs (d > 2) under open 
boundary conditions. 
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